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Abstract 

We initiate a study of random instances of nonlocal games. We show that 
quantum strategies are better than classical for almost any 2-player XOR game. 
More precisely, for large n, the entangled value of a random 2-player XOR game 
with n questions to every player is at least 1.21... times the classical value, for 
1 — o(l) fraction of all 2-player XOR games. 

1 Introduction 

Quantum mechanics is strikingly different from classical physics. In the area of 
information processing, this difference can be seen through quantum algorithms 
which can be exponentially faster than conventional algorithms [321 Hi] and 
through quantum cryptography which offers degree of security that is impossible 
classically [5]. 

Another information-theoretic way of seeing the difference between quan- 
tum mechanics and the classical world is through non-local games. An example 
of a non-local game is the CHSH (Clauscr-Horne-Shimony-Holt) game This 
is a game played by two players against a referee. The two players cannot com- 
municate but can share common randomness or a common quantum state that 
is prepared before the beginning of the game. The referee sends an indepen- 
dent uniformly random bit to each of the two players. Each player responds 
by sending one bit back to the referee. Players win if a; © y = i A j where i,j 
are the bits that the referee sent to the player and x, y are players' responses. 
The maximum winning probability that can be achieved is 0.75 classically and 
I + — 0.85... quantumly. 

There are several reasons why non-local games are interesting. First, CHSH 
game provides a very simple example to test the validity of quantum mechanics. 
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If we have implemented the referee and the two players A, B by devices so 
that there is no communication possible between A and B and we observe the 
winning probability of 0.85..., there is no classical explanation possible. Second, 
non-local games have been used in device- independent cryptography [TJ I25j . 

Some non-local games show big gaps between the classical and the quan- 
tum winning probabilities. For example, Buhrman et al. [7] construct a 2- 
player quantum game where the referee and the players send values x,y,i,j G 
{1, . . . , n} and the classical winning probability is 5 while the quantum 

winning probability is 1. In contrast, Almeida et al. \2i construct a non-trivial 
example of a game in which quantum strategies provide no advantage at all. 

Which of those is the typical behaviour? In this paper, we study this ques- 
tion by looking at random instances of non-local games. 

More specifically, we study two-party XOR games. This is a subclass of 
non-local games with 2 players, where the referee randomly chooses inputs 
i G {1,2,..., n}, j e {1,2,..., k} and sends them to the players. The players 
reply by sending bits x and y. The rules of the game are specified by an n x fc 
matrix A whose entries are -1-1 and —1. To win, the players must produce x 
and y with a; = y if Aij = 1 and x and y with x ^ y if Aij = — 1. 

We consider the case when the matrix A that specifies the rules of the game 
is chosen randomly against all ±l-valued n x k matrices A. For the case when 
n = k, we show that 

• The maximum winning probability pq that can be achieved by a quantum 
strategy is | + ^^^"^ with a probability 1 — o(l); 

• The maximum winning probability pd that can be achieved by a classical 
strategy satisfies 

1 0.6394... - 0(1) 1 0.8325... + 0(1) 

2 2 ^/ri 

with a probability 1 — o(l). 

In the literature on non-local games, one typically studies the difference between 
the winning probability pq (pd) and the losing probability 1 — pq {I — pd): 
Aq — 2pq — 1 (Ac/ = 2pci — 1). The advantage of quantum strategies is then 
evaluated by the ratio For random XOR games, our results imply that 

1.2011... < -r^ < 1.5638... 

Ad 

for almost all games. Our computer experiments suggest that, for large n, 
« 1.305.... For comparison, the biggest advantage that can be achieved 
in any 2-player XOR game is equal to Grothendieck's constant Kq [H] about 
which we know that [TBI Ell IS] 

1.67696.... <Kg< 1.7822139781... 

Thus, the quantum advantage in random XOR games is comparable to the 
maximum possible advantage for this class of non-local games. 
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We find this result quite surprising. Quantum-over-classical advantage usu- 
ally makes use of a structure that is present in the computational problem 
(such as the algebraic structure that enables Shor's quantum algorithm for fac- 
toring |24jV Such structure is normally not present in random computational 
problems. 

The methods that we use to prove our results are also quite interesting. 
The upper bounds are easy in both classical and quantum case but both lower 
bounds are fairly sophisticated. The lower bound on the classical value of 
random XOR games requires a subtle argument that reduces lower-bounding 
the classical value to analyzing a certain random walk. The lower bound for 
the entangled value requires proving a new version of Marcenko-Pastur law |18) 
for random matrices. 

Related work. Junge and Palazuelos [17] have constructed non-local 
games with a big gap between the quantum (entangled) value and the clas- 
sical value, via randomized constructions. The difference between this paper 
and \171 is as follows. The goal of [T7] was to construct a big gap between the 
entangled value and the classical value of a non-local game and the probability 
distribution on non-local games and inputs was chosen so that this goal would 
be achieved. 

Our goal is to study the behaviour of non-local games in the case when the 
conditions are random. We therefore choose a natural probability distribution 
on non-local games (without the goal of optimizing the quantum advantage) 
and study it. The surprising fact is that a substantial quantum advantage still 
exists in such setting. 

2 Technical preliminaries 

We use [n] to denote the set {1, 2, . . . , n}. 

In a 2-player XOR game, we have two players A and B playing against a 
referee. Players A and B cannot communicate but can share common random 
bits (in the classical case) or an entangled quantum state (in the quantum 
case). The referee randomly chooses values i £ {1, . . . , n} and j G {1, . . . ,n} 
and sends them to A and B, respectively. Players A and B respond by sending 
answers x € {0, 1} and y S {0, 1} to the referee. 

Players win if answers x and y satisfy some winning condition P{i, j, x,y). 
For XOR games, the condition may only depend on the parity x(Sy oi players' 
responses. Then, it can be written as P{i,j,x ® y). 

For this paper, we also assume that, for any i, j, exactly one of P{i, j, 0) and 
P{i,j, 1) is true. Then, we can describe a game by an n x n matrix (^ij)f 
where A^j = 1 means that, given i and j, players must output x, y with x(Sy — 
(equivalently, x = y) and Aij = — 1 means that players must output x^y with 
x®y = \ (equivalently, x ^ y). 

Let ps,win be the probability that the players win if they use a strategy S 
and ps.ios = 1 — Ps,win be the probability that they lose. We will be interested 
in the difference A5 = Ps,win ~ Ps.ios between the winning and the losing 
probabilities. The classical value of a game, A^;, is the maximum of A5 over 
all classical strategies S. The entangled value of a game, Aq, is the maximum 
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of A5 over all quantum strategies S. 

Let pij be the probability that the referee sends question i to player A and 
question j to player B. Then \lOi section 5.3], the classical value of the game 
is equal to 

n 

Aci = max max > pijAijUiVj. (1) 

In the quantum case, Tsirelson's theorem [8 implies that 

n 

^^if 1 max Vp.yA„(Mi,Wj) (2) 

where the maximization is over all tuples of unit-length vectors wi, . . . , u„ G i?'', 
Vi, . . . ,Vn G R'^ (in an arbitrary number of dimensions d). 

We will assume that the probability distribution on the referee's questions 
i, j is uniform: pij — ^ and study Ac; and for the case when A is a random 
Bernoulli matrix (i.e., each entry Aij is +1 with probability 1/2 and —1 with 
probability 1/2, independently of other entries). 

Other probability distributions on referee's questions can be considered, as 
well. For example, one could choose yij to be normally distributed random 
variables with mean and variance 1 and take pij = j2"^^'^'\v I ' more 
generally, one could start with yij being i.i.d. random variables from some 
arbitrary distribution D and define Pij in a similar way. 

Most of our results are still true in this more general setting (with mild 
assumptions on the probability distribution D). Namely, Theorem [l] and the 
upper bound part of Theorem H] remain unchanged. The only exception is the 
lower bound part of Theorem ^ which relies on the fact that the probability 
distribution p^j is uniform. It might be possible to generalize our lower bound 
proof to other distributions D but the exact constant in such generalization of 
our lower bound could depend on the probability distribution D. 



3 Quantum upper and lower bound 

Theorem 1 For a random 2-player XOR game with n inputs for each player, 

_2±o(l) 
Vn 

with probability 1 — o(l). 

Proof: Because of ([2|, proving our theorem is equivalent to showing that 



1=1 j=i 

holds with probability 1 — o(l). 



,m,^^^„ Yl Yl ^j) = (2 ± 0(1)) 

^ k; .,■ ^ 1 ^_ ^ ^_ ^ 
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For the upper bound, we rewrite this expression as foUows. Let m be a 
vector obtained by concatenating all vectors Ui and z; be a vector obtained by 
concatenating all Vj. Since \\ui\\ = \\vj\\ = 1, we have = = We 
have 

n n 

^^A,j{u,,Vj) = {u, < ||m|| • \\A(^I\\ ■ \\v\\ < \\A\\n. 

i=i j=i 

By known results on norms of random matrices |29) . \\A\\ = (2 + o{l))^/n with 
a high probability. 

For the lower bound, we note that 



II >ll II J II = i " 3"- 1=1 j = l 



We have 

Theorem 2 (Marcenko-Pastur law, [18j) Let A he any. n random matrix 
whose entries Aij are independent random variables with mean and variance 
1. Let C G [0, 2]. With probability 1 — o(l), the number of singular values A of 
A that satisfy A > Cy/n is (/(C) — o(l))n where 




— Idx. 



Let Ai, Am be the singular values of A that satisfy Aj > {2 — e)y^. With 
high probability, we have m £ [(/(2 — e) — o{l))n, (/(2 — e) + o(l))n]. We now 
assume that this is the case. 

Let li and be the corresponding left and right singular vectors: Art — XiU. 
(Here, we choose li and so that \\li\\ — ||rj|| = 1 for all i.) Let Uj and be 
the components of h and rj: U = {lij)J^i and r.; = {rij)^^i. 

We define uj and Vj in a following way: 

Uj = {hj)i=l: Vj — {rij)i^i. 

We have 

n n n n m 

X X "i) = X X X a^^fc^^feJ 

i=l j = l i=l j = l A: = l 

m m 

= ^(;fe,Arfc) =^Afc > (2-e)mVn. (3) 

k=l k=l 

Since — \\ri\\ — 1 and the vectors Ui and Vj are obtained by rearranging 
the entries of li and r^, we have 

n n 

1=1 1=1 



5 



and, similarly, H'l'ilP = ni. If Ui and u,; all were of the same length, we would 
have IIujIP = \\vi\\'^ = ^. Then, replacing and by u- = and = 
would increase each vector ^/~^ times and result in 



5^^a.,K,z;;.)>(2-e) 



To deal with the general case, we will show that almost all Ui and Vi are of 
roughly the same length. Then, a similar argument will be used. The key to 
our proof is a new modification of Marcenko-Pastur law. 

Theorem 3 (Modified Marcenko-Pastur law) Let A be an n x n random 
matrix whose entries Aij are independent random variables with mean and 
variance 1. Let C £ [0,2]. Let be the i^^ vector of the standard basis. Let 
Pc be the projector on the suhspace spanned by the right singular vectors with 
singular values at least Cy^. Then, 



Pr[\\\Pce,r-f{C)\>e]^o(^^^ 



with the big-0 constant depending on C and e. 

The same result also holds for the left singular vectors. 
Proof: In appendix [A| | 

We now complete the proof, assuming the modified Marcenko-Pastur law. 
Since Pc is spanned by the right singular vectors ri , . . . , , we have 

m m 

wpce^r ^Y.('-^,e,f ^Y.^%^\\v.r- (4) 

Therefore, the modified Marcenko-Pastur law means that 
Pr[\\v,f> f{2-e) + S]=o(- 



Thus, the expected number of i e {!,..., n} for which > /(2 — e) + 5 is 

0(1). We now apply the following transformations to vectors vf. 

1. For each Vi with \\vi\\^ > f{2 - e) + S (or Ui with > /(2 - e) + S), 
we replace it by the zero vector ; 

2. We replace each Vi by 

, Vi 



and similarly for Ui. 

After the first step < /(2 — e) + 6 for all i. Hence, after the second step, 

\\v'i\\^ < 1 for ah i. 

We now bound the effect of those two steps on the sum 

n n 

^^aij{ui,Vj). 
1=1 j=i 
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Because of ([s]), the initial value of this sum is at least 

(2 - t)m^ > (2 - e)(/(2 - e) - o(l))n^/2_ (^5) 

Because of = ||Pcej|P < ||ej|p ^ 1. Similarly, < 1. Hence, 

< 1 and replacing one Vj (or Wi) by changes the sum by at most 
X]"=iJ^iil = Replacing 0(1) Vj's (or m^'s) changes it by 0(n). Since the 
sum ([5| is of the order 8(ri'^/^), this is a lower order change. 

Replacing u^'s by w"s (and u^'s by similarly defined mJ's) increases each inner 
product (ui , Vj ) j(-2_^e)+i5 times and achieves 

Since this can be achieved for any fixed e > and (5 > 0, we get that 



n n 



max > > a,j(u^,i;') > (2-o(l)) 



4 Classical upper and lower bound 

In the classical case, we have to estimate 



max 



s max > AiMiV^. (6) 



There are several ways how one can interpret this expression. First, ([6]) is equal 
to the Ico ~^ h norm of A (denoted ||^||oo->-i)- It is known that, for a random 
matrix A, ||yl||oo^i = Q{ny/n) (e. g., from |2S]). but the exact constant under 
O is not known. 

Gittens and Tropp [12^ show that, if yl is a matrix whose entries are i.i.d. 
random variables, then 



II^IU^i <2^(P||,„, + ||A' 



col I 



where || A||coi denotes the sum of the I2 norms of the columns of A. For our case, 
this gives an upper bound of |jA|joo->.i < 'in^/n which is substantially weaker 
than our Theorem |4] below (and would be insufficient to show a gap between 
classical and entangled values for random XOR games). 

In the context of statistical physics, there has been substantial work on 
determining the order of 

n 

max > AijUiUj (7) 
«i,...,u„G{-i,i} ■' 

when Aij is a symmetric Gaussian matrix (each Aij = Aji is an independent 
Gaussian random variable with mean and variance 1). It is known that ([t]) is 
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equal to (1.527... +o(l))n'^/2 -with probability 1 — o(l). This was first discovered 
in [23l [2Tj and rigorously proven by Talagrand ^28] . 

The quantities Q and Q are of similar flavour but are not identical and 
the work on Q does not directly imply anything about our problem, b 

One can also interpret (|6| combinatorially, as a problem of "unbalancing 
lights" In this interpretation, n x n matrix represents an array of lights, 
with each light being "on" {Aij — 1) or "off' {Aij = — 1). We are allowed to 
choose a row or a column and switch all lights in this row or column. The task 
is to maximize the difference between the number of lights that is on and the 
number of lights that is off. It is known that for any n x n matrix A with ±1 

entries, (6) is at least ^J~^n^^'^ [3, p. 19]. We were not able to find any work on 
evaluating ([6| for a random matrix A in this context. 

Theorem 4 For a random 2-player XOR game, its classical value Ad satisfies 

1.2789... . 2Vln2 + o{l) 1.6651... + o(l) 
-1= — < < -1= = j= 

with probability 1 — o(l). 

This is equivalent to 

1.2789...n3/2 < ||A|loo->i < 1.6651. ..n^/^ 

for a Bernoulli random matrix A. 

In computer experiments, the ratio ^^^^g'^^"^'" grows with n and reaches 1.4519... 
for n = 26. By fitting a formula an^^"^ + bn where the leading term is of the 
order n^^^ and the largest correction term is of the order n to the data, we 
obtained that 

Plloo^i ~ 1. 53274.. .^3/2 _ 0.472806.. .71. 

Figure [I] shows the fit. Curiously, the constant in front of 7i'^/2 jg very close 
to the constant 1.527... for the sum ^. We are not sure whether this is a 
coincidence or there is some connection between the asymptotic behaviour of 
the two sums. 

Proof: The upper bound follows straightforwardly from Chernoff bounds. We 
use the following form of Chernoff inequality: 

Theorem 5 p. 263] Let Xi, . . . , Xn be independent random variables with 
Pr[X, = 1] = Pr[X, = -1] = i andlet X = Xi + + X^. Then, 

Pr[X >a]< e^tl. 

Let xi,...,Xn G { — 1,1} and yi,...,7/„ e { — 1,1} be arbitrary. If Aij e 
{— 1, 1} are uniformly random, then AijXiyj G {—1, 1} are also uniformly ran- 
dom. Hence, ^ AijXiyj is a sum of 7i2 uniformly random values from {—1, 1}. 
By Theorem [5j 



Pr 



3 
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< e" 



1 
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Figure 1: ||^||oo-5>i, for random n x n matrices A 



By taking C — 2\/ln 2 + 2 , we can ensure that this probability is less than 
22n„2 ■ Then, by the union bound, the probability that ^ AijXiyj > Cn^ for 
some choice of Xi's and y/s is less than 2^" 2^''n'^ ~ n^"- 
To prove the lower bound, we first show 

Lemma 1 Let A be an n x n random Bernoulli matrix. Then, 



max y UiVjAi 



> (1.2789... -o(l))n3/2_ 



Let X — max„._„^.g{_i 1} X- ^ w^UjAij. By Lcmmajlj E[X] > (1.2789... — 
o(l))ri3/2. To prove that X > (1.2789... - o{l))n^/^ with probability 1 - o(l), 
we show that X is concentrated around -Ei^]. 

Lemma 2 Let X — max„. ij j UiVjAij for a random n x n matrix 

A. Then, 

Pr [\X - E[X] \ > an] < 26^"'/^ 

We then apply Lemma [2] with a — logn (or with a = f{n) for any other 
f{n) that has f{n) — >■ oo when n — >■ oo and f{n) = o{y'n)) and combine it with 
Lemma [T] 

It remains to prove the two lemmas. 
Proof: [of Lemma [I] Let v4 be a random ±1 matrix. We choose Ui and vj, 
according to Algorithm [T] 

Because of the last step, we get that 

n n n 
i=l j = l j = l 



Each of Sn j is a random variable with an identical distribution. Hence, 



E 



i=i j=i 



nE\Sn,i\- 



(8) 
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1. Set ui = 1. 

2. For each k = 2, . . . ,n do: 

(a) For each j = 1, . . . ,n, compute Sk-ij = J2i=i '^ij'^i- 

(b) Let ak = {Z{Sk-i,i), Z{Sk-i,2), Z{Sk-i-n)) where Z{x) = 1 if x > 0, 
Z{x) = —1 if X < and Z{x) = 1 or Z{x) = —1 with equal probabihty ^ 
if X = 0. 

(c) Let bk = {Aki,Ak2,---,Akn)- 

(d) Let Ui G {+1, —1} be such that ai and biUi agree in the maximum number 
of positions. 

3. For each j = 1, . . . ,n, let Vj be such that VjSnj > where Snj = X^ILi ^ij'^i- 

Algorithm 1: Algorithm for choosing Ui and Vj for a given matrix A. 



We now consider a random walk with a reflecting boundary. The random 
walk starts at position 0. If it is at the position 0, it always moves to the 
position 1. If it is at the position i > 0, it moves to the position i + 1 with 
probability ^ + | and position i — 1 with probability ^ — §■ Let be the 
position of the walker after i steps. 

Lemma 3 |S'n,i| — Kf^ for some e = (I + o(l)) 

Proof: hi — {An, . . . ,Ain) is a vector consisting of random ±l's that is inde- 
pendent of Qi. Hence, the expected number of agreements between and biUi 

is + |)n where e = (I + o{l))^J~^ p. 21]. Moreover, the probability of 
and biUi agreeing in location j is the same for all j. 

Hence, if > 0, we have |S'i^i| = |S'i_i,i| + 1 with probability 5 + f 

and li^i,!] — \Si^i_i\ — 1 with probability 5 — §. If |'S',;-i,i| = 0, then we always 
have 15^,1 1 = 1. | 

Lemma 4 For a random walk with a reflecting boundary and e = ^ , we have 
E[K^] > if (a) - o{l))y/Vi where 



/,,,=^.-#;?+„+(i+„)Erf(^)j. 

Proof: In appendix [b| | 

By combining ([s]) and Lemmas |3] and |4j the probability of winning minus 
the probability of losing in the classical case of a random XOR game is at least 




1 _ 2 + 2e-V- + (2 + ^)Erf(^ 
^2 ~ 2V2tt 



n 



= 1.2789076012442957.. .n" 
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Proof: [of Lemma [2] Let 

/(Aii,Ai2, . . . , A„„) = max y^ u^VjAij. 

ui,vje{—i.i} 

Then, changing one Aij from +1 to —1 (or from —1 to +1) changes ^ UiVjAij 
by at most 2. This means that /{An, . . . , Ann) changes by at most 2 as welL In 
other words, / is 2-Lipschitz. By applying Azuma's inequahty [121 p. 303-305] 
with c = 2, i = n^, A = |, we get 

Pr [|/(An, . . . , Ann) ~ . . . , A„„)]| > an\ < 2e-°'/8. 

I 

5 Conclusion 

We showed that quantum strategies are better than classical for random in- 
stances of XOR games. We expect that similar results may be true for other 
classes of non-local games. 

A possible difficulty with proving them is that the mathematical methods 
for analyzing other classes of non-local games are much less developed. There is 
a well developed mathematical framework for studying XOR games [8j [101 [SH] 
which we used in our paper. But even with that, some of our proofs were quite 
involved. Proving a similar result for a less well-studied class of games would 
be even more difficult. 

Acknowledgments. We thank Assaf Naor, Oded Regev and Stanislaw 
Szarek for useful comments and references to related work. 
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A Modified Marcenko-Pastur law 



In this section, we prove Theorem [3j 

Without loss of generality, we assume that i = 1. Let Ai, . . . , A„ be the 
singular values of A and let be the corresponding right singular vectors. Let 

= {ei\ri). 

Let X be a random variable that is equal to — ^ with probability \ai\ . Let 
y be a random variable distributed according to the Marcenko-Pastur law with 
parameters y = 1 and cr = 1 [31 Section 3.1]. Then, the probability density 
function of Y is equal to 

p(a;) = -^,/i-l 
^ ' 2n\l X 

for X e [0,4] and P{x) — otherwise. Let Fx{x) and Fy{x) denote the 
cumulative distribution functions of X and Y respectively. We would like to 
show that Fx{x) and Fy{x) are close to one another. 

We say that a family X of random variables is uniformly sub-Gaussian if 
there exist constants c abd C such that 

Pr[\X\ >x\< Ce-"^' 

for every X X. From Levy's continuity theorem [291 P- 99], we have 

Theorem 6 Let Y he a fixed random variable on some interval [a, b] and let 
X be a uniformly sub- Gaussian family of random variables. Let e > be fixed. 
There exist k > 0, 6 > such that if X Cz X satisfies 

\E[X"'] - ElY""]] < 6 (9) 

for all m G {1, . . . , k}, then we have 

\Fx{x)-Fy{x)\ <e 

for all X € [a, b] . 

We select a constant D so that 

Pr[\\A\\ > Dy^i] =0 

By concentration results for norms of random matrices (Proposition 2.3.10 of 
^\), D = 2 + o(l) suffices. We define X as the family of random variables 

X, for all A that satisfy ||^|j < Dsfn. Since X = ^ and Ai < ||A||, we have 
Pr\X > D^] = 0. Hence, X is uniformly sub-Gaussian because constants C, c 
can be chosen so that Ce~'^^ > 1 for a; : < a; < D . 

To show theorem [3 it suffices to prove that the random variable X ^ X 
satisfies the condition (9| for all me {1,2,..., k}, with probability 1 — 0{^). 
Since k is fixed, this is equivalent to showing ^ for each to S {1, . . . , fc} with 
probability 1 — O(^). For the random variable Y, its moments are quite well 
known: 
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Lemma 5 

where Cm = rn\(m+iy. denotes the m*'' Catalan number. 

Proof: From [4, Lemma 3.1], we have = Y^^i N{m,i) where N{m,i) 

are Narayana numbers N{m,i) — From pT], page 237], we have 

YZiN{m,i) = Cm. I 

We introduce a random variable Mm that is equal to i?[X™] for a random 
matrix A. To show that Mm S [Cm — Cm + ^] with a high probability, 
we bound the expectation and the variance of Mm and then use Chebyshev's 
inequality. We have 

Lemma 6 Let m > be fixed. Then, for large n, 

E[Mm] =(^1 + 0Q^^^ Cm- 

Proof: Let B = A. Then, the eigenvalues of B are Af , . . . , A^. We have 

1 " 1 

^'im = —Y.>^}-\a^'^—{e,\B-\e,). 

i=l 

By expanding (ei|-B™|ei), we can write 

m— 1 



~ ^ X! "ii,iai2™-i4 n a»23-i,*2j"i2j+i,»2r (10) 

il,---,i2m-l = l j = l 

E[Mm] is equal to times the sum of expectations E\T], 

m — 1 

T flii,iai2„_i,l ^i2j-l,i2]^i2j + l:i2j ■ (11) 

J = l 

If T consists of a^j./tj, 0^2.^21 •■•7 Qji.fei occurring ci , C2 Q times, we have 

i?[T]=£;[a-^,Ji?[a-,,J...i?[a-,J. (12) 

If Ci is odd for some z, then a^' is 1 with probability 1/2 and -1 with probability 
1/2. Hence, E[aj^ ] = 0. If all q are even, then aj' /j. =1 and (12) is equal to 
1. Thus, (10 1 is equal to the number of terms T in which each Uj.^ki occurs an 
even number of times. We call such terms good. 

For a term T, let |r| to be the number of different numbers appearing in 
the sequence 1, ii, 12, . . . , i2m-i- We say that a good term T is standard if 
the sequence 1, ii, j2, ■ • ■ , *2m-i is such that the i*"^ different number of this 
sequence is equal to i, for every i € {2, . . . , If we have a standard term T, 
we can obtain (n — l)(n — 2) . . . (n — |T| + 1) good terms from it, by replacing 
all occurrences of 2, . . . , |T| with distinct numbers j2,---ij\T\ G {2, •..,«}. 
Moreover, each of those good terms can be obtained only from one standard 
term. 



15 



Therefore, we have 

^[^"l^;^ E (^-l)(r^-2)...(n-|T| + l). 
T-standard 

Since the sequence l,ii,i2, . . . ,«2m-i rnay contain at most 2to different num- 
bers, we have |r| < 2m. Since m is a fixed constant, this imphes 

T-standard 

Let t be the largest possible value for \T\. Then, 

E\M^] = — ( 1 + O 1 - I I n*"^ \{T ■■ T- standard. ITI = t}\ . (13) 
n™ \ K''^ J J 

The contributions from standard terms T with |T| < t can be absorbed into 
the O(-) factor, since n'^'"^ < = -n*~^ for each of those terms and the 
number of standard terms is a constant (it depends only on t and not n). 

It remains to determine t and to count the standard terms T with |T| = t. 
We claim that t = m + 1. 

Let r be a standard term. Consider the graph G with the set of vertices 
V = {l,...,|r|} and the set of edges E consisting of all the different pairs 
that appear in the sequence («i,l), (ii,i2), (*3,«2), (i2m-i,l)- Since 
T is good, each such (?, j) appears an even number of times, i.e. at least twice. 
Therefore, \E\<'^^ m. 

The graph G is connected. Hence, |y| < + 1 < m + 1. Moreover, 
\V\ = m-|- 1 is achieved, for example, by the graph G obtained from a standard 
term 

(2, 1), (2, 1), (3, 1), (3, 1), . . . , (to + 1, 1), (to + 1, 1) 

in which i2j — 1 and i2j-i = J + 1. It remains to count the standard terms T 
with |r| = TO + 1. 

For each such term, we have \V\ = to + 1 and \E\ = to. Hence, G must 
be a tree, with each edge occuring exactly twice in the sequence (ii, 1), (ii, ^2): 
(is, 12), . . ., (i2m-i, 1)- We now consider the sequence 

(l,il), (11,^2), (42, is), • • • , («2m-l, 1) 

in which all edges are directed so that the sequence is a closed walk in the 
tree G. The conditions on this sequence that we have are the same as the 
conditions on a non-crossing cycle in [27, p. 144]. As shown in [291 p. 144-145], 
the number of non-crossing cycles of length 2m is equal to the Catalan number 
Cm- The lemma follows by substituting this and t = m + 1 into equation ( 13 ). 

I 

By a similar argument, we have 
Lemma 7 Let m > be fixed. Then, for 

E[Mi]^(l + o(l)){Cmr 
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Proof: The proof is similar to the proof of Lemma|6] We omit it in this version 
of the paper. | 

From Lemmas [6] and [Tj we get the following corollaries. 

Corollary 1 Let m > be fixed. Then, for large n, 

D[Mm] = O (^^ Cl. 

Proof: FoUows from D[X] = ^[X^] - E'^[X], Lemma[6]and LemmajTj | 
Corollary 2 Let m > and 5 > he fixed. Then, for large n, 

Pr[\Mm-Cra\>5] = o{^^. (14) 

Proof: Follows from Corollary [l] and Chebyshev inequality. | 
This completes the proof of Theorem [3] 

B Analysis of a random walk with a reflecting 
boundary 

B.l Proof overview 

In this section, we prove Lemma |4] To simplify notation, we write Ki instead 
of Kl . We have 

E[K,+^] = Pr[K, > Q\E[K,+^\K, > 0] + Pr[K, = Q\E[K,+^\K, = 0]. 
If Ki = m and m > 0, we have 

E[K,+,\K, = ™] = + ("^ + 1) + Q - (m - 1) = m + e. 

We also have E[Ki^i\Ki = 0] = 1. Hence, 

E[K,+,] = E[K,] + e + (1 - e)Pr[K, = 0]. 
By induction, this implies 

n-l 

E[Kn] = en + (1 - e) ^ Pr [K, = 0] . 

1=0 

We have en = a^/n. It remains to bound X]"=7o^ P^' = 0]. The first step is 
to express this sum in terms of binomial coefficients. 

Claim 1 

.A} \A-\ 



z=0 



S=0 A=max(0,S-L^J+l) 



where p ~ + f • 
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We can express this sum as a difference of two sums 

2L^J-2 LfJ 

.Aj \A-1 



S'=0 A=max(0,S-L=^J+l) 
2L^J-2 LfJ 

.A} \A-l 



5=0 A=0 

2L^J-2S-L^J 

, ,A) \A-l 

Claim 2 

2L^J-2 LfJ 



E E((!)-f/,))!>-^(i-^)'-"s 



S=0 A=0 



where 



(1 + o(l)) ( . - a + ( - + a ) Erf 



Erf(z) = ^ / e^^^dt. 
a/tt 



727 



zs i/ie error Junction for the normal distribution. 
Claim 3 

2L^J-2S-L^J 

, ,A) \A-l 
Lemma |4] now follows by combining Claims [T] [2] and |3] 
B.2 Proof of Claim H 

Let T(to, be the number of paths of length 2m that start and end at the point 
0, increase or decrease the coordinate by 1 at each step, never take a negative 
coordinate and return to the location t times (including the last step). Then, 
for any m > and i : 1 < i < m, we have [13l p. 203], 

t /2m - < - 1\ 
^("^'^)-m( m-t )■ 

We have 

n-l l^i 

^ Pr [X, - 0] = 1 + 2 Pr [K2m = 0] = 

i=0 m=l 
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1+ E E^(™'^)p'""'(i-^')' 



m=l t=l 



m=l t=l 



Here, the first equality follows from the fact that the random walk can return 
to location only after an even number of steps. The second equality follows 
by partitioning the paths that return to after 2m steps according to the total 
number of times the path returns to (including the final return to 0). If the 
path returns to < times, then it also leaves t times. Hence, there are t steps 
in which we move right with probability 1 (the steps which start at location 
0) . There are also m — t other steps when the path moves right (each of those 
steps is taken with probability p) and m steps when the path moves left (each 
of those steps is taken with probability 1 —p). This means that the probability 
of each path with t returns is p™~*(l — p)™. The third equality follows from 



Let S = 2m — t — 1 and A = m — t. We can rewrite the sum (16) in the 
following way: 

L-^M"J ™ /r, J- 1\ J- 

'2m — t — 1\ t 



m=l t=i 



2L^J-2 LfJ 



s=o 



A=max(0,S- +1) 



2L=i^J-2 LfJ 



i: E a - 

S=0 A=max(0,S-L^J+l) ^ ^ ^ ^ 

where the last equality follows from 

'S\S-2A + 1 fS\ ( S 



S-A+l 



^AJ S-A + 1 \AJ \A-1 
This completes the proof of the claim. 

B.3 Proof of Claim [2] 

We denote m — 2[^^yiJ — 2. By using the identity 

k 



A=0 
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we obtain 



m L 2 -I 

EE 

S=0 A=0 



S 

A-l 



S 



™ LfJ-i 



S-A 



E i5iK^^(i-p)^^^-2e(i+o(i))E E ( Mii'pf-^- 

S=0 ^L2J/ A=0 

Claim [2] now follows from the following two claims. 
Claim 4 



Claim 5 

m Ll" J ~ 1 

2^E E 

S=l A=0 



.2 2 



TT \a 



a Erf 



Proof: [of Claim [4] We have 

(JjyfJ(i-p)rfi=(i + o(i))-^2^(p(i-p))^/^ 



= (l + o(l))^e--^ 



(18) 



where 7 



' In 4p(l— p) 



Here, the first equality follows from approximations of 



binomial coefficients. We denote W{n, S) 



_ V2e 



/ttS 



The identity ( 18 ) means 



that, for every 6 > 0, there exists Sq such that 
for all S > So- Therefore, we have 



E 

s=o 



S 



l(l_p)rfl > (l-,5) ^ Win,S) 



S=So 



> (l-(5) / W{n,S)dS 



{l-S)^ (Erf(V7^) - Erf(VT^)) 



(19) 



Here, the inequality follows from the function W{n, S) being decreasing in 5. 



The last equality follows from / ^^dt 
_ -ln4p(l -p) -ln(l - e2) 



For large n, we have 



In 1 - ^^ 

n 

1 



= (l + o(l)) 



2n' 
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We also have 7m = (1 + o{l))^ = (1 + o{l))^ and 7^0 = (1 + 0(1))^^^ = 
0(1). Hence, (19 1 is equal to 



(l-5)(l + o(l))^Erf 
a 



a 

71 



Since this is true for any S > 0, the claim follows. | 

Proof: [of Claim [5] We can interpret X]!4=o^^ {a)p^(^ — p)^~^ as follows. 
Let X — Xi + ... + Xg where Xi are independent random variables with 

Pr[X, = l]=p and Pr[X, = -l] = l-p. Then, 



LfJ-i 



A=0 



S . 



x< - S-[-\ + l 



s ^ 



< Pr[X < 0] 



By Central limit theorem, for large S, Pr[X < 0] tends to 

^Erfc ( ^ lErfc f 
where Erfc(a;) = 1 — Erf(a;). Since p ~ ^ + ^, e — this is equal to 



2 I y2T^ 



(l + o(l))^Erfc^ 



Hence, 



m Lfj-1 



S'=l A=0 



S 



5=1 



"71" 



< (1+0(1))- 



Erfc 



/2n 



dS 



= (l + o(l))V^ a-e 



Here, the inequality follows from Erfc being decreasing and m < n and the last 
equality follows from 



y Erfc(a\/t) 



Erf(aVt) Vte-'' * , ^ r^^ 
dt ^ ^ ^ ' - ^ + t{l - Erf(av^)). 
za^ vvra 



B.4 Proof of Claim I 



Let m — [^^J. Because of (17 1, we have 



2m — 2 S—m 

E E 

S=m A=0 



S 

A) \A-1 



P^{l-P) 



S-A 
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2m-2 / ^ 2m-2 5-m-l 



S=m ^ ' S=m A=0 

2m-2 



\S-A 



S—m 



S 



where the last inequahty follows from 

that holds if 2 A < S. Claim [3] now follows from 
Claim 6 Let m be a positive integer. Then, 

Proof: We give a combinatorial proof of this equality. 

Consider a random process where we have two boxes each containing m + 1 
pebbles. At each step we choose one of the two boxes with equal probability 
i and remove one pebble from this box. The process finishes when we remove 
the last pebble from one of the two boxes. 

We calculate 

Pr [The process finishes after k steps] = 

2Pr [The process finishes after k steps and the first box is empty] 

To empty the first box we need to choose the first box in m out of fc — 1 
steps which happens with a probability of C^"^) 2¥=t and in the last step we 
need to choose the first box which happens with a probability of \ . Thus 



Pr [The process finishes after k steps] — 2 



fc - 1\ 1 1 /fc - 1\ 1 



2'=-! 2 V m / 2^-1 



The number of steps k can be any value from to + 1 to 2to + 1 including. 
Because the process must end after some number of steps we obtain 



Claim |3] follows by substituting k — S 



^ V TO ) 2*^-1 ^' 
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